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(54) Self-protecting documents 

(57) A system and method for the secure distribution 
of electronic documents reduces the likelihood of unau- 
thorized reproduction and redistribution by either au- 
thorized or unauthorized recipients. A self-protecting 
document (SPD) contains an encrypted document as 



well as a secure set of permissions and the software 
necessary to process the document; full decryption of 
the document is performed as late as possible so as to 
minimize the possibility of intercepting the document be- 
fore it has been fully rendered to screen or to paper. 
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Description 

[0001] The invention relates to document rights management, and more particularly, to a self-protecting document 
scheme that enables electronic document protection without the need for additional software or hardware support for 
5 protection. 

[0002] One of the most important issues impeding the widespread distribution of digital documents via electronic 
commerce is the current lack of protection of the intellectual property rights of content owners during the distribution 
and use of those digital documents. Efforts to resolve this problem have been termed "Intellectual Property Rights 
Management" ("IPRM"), "Digital Property Rights Management" ("DPRM"), "Intellectual Property Management" ("IPM"), 

'0 "Rights Management" ("RM"), and "Electronic Copyright Management" ("ECM"). 

[0003] A document, as the term is used herein, is any unit of information subject to distribution or transfer, including 
but not limited to correspondence, books, magazines, journals, newspapers, other papers, software, photographs and 
other images, audio and video clips, and other multimedia presentations. A document may be embodied in printed 
form on paper, as digital data on a storage medium, or in any other known manner on a variety of media. 

15 [0004] In the world of printed documents, a work created by an author is usually provided to a publisher, which 
formats and prints numerous copies of the work. The copies are then sent by a distributor to bookstores or other retail 
outlets, from which the copies are purchased by end users. 

[0005] While the low quality of copying and the high cost of distributing printed material have served as deterrents 
to the illegally copying of most printed documents, it is far too easy to copy, modify, and redistribute unprotected elec- 
20 tronic documents. Accordingly, some method of protecting electronic documents is necessary to make it harder to 
illegally copy them. This will serve as a deterrent to copying, even if it is still possible, for example, to make hardcopies 
of printed documents and duplicate them the old-fashioned way. 

[0006] With printed documents, there is an additional step of digitizing the document before it can be redistributed 
electronically; this serves as a deterrent. Unfortunately, it has been widely recognized that there is no viable way to 
25 prevent people from making unauthorized distributions of electronic documents within current general-purpose com- 
puting and communications systems such as personal computers, workstations, and other devices connected over 
local area networks (LANs), intranets, and the Internet. Many attempts to provide hardware-based solutions to prevent 
unauthorized copying have proven to be unsuccessful. 

[0007] Two basic schemes have been employed to attempt to solve the document protection problem: secure con- 

30 tainers and trusted systems. 

[0008] A "secure container" (or simply an encrypted document) offers a way to keep document contents encrypted 
until a set of authorization conditions are met and some copyright terms are honored (e.g., payment for use). After the 
various conditions and terms are verified with the document provider, the document is released to the user in clear 
form. Commercial products such as IBM's Cryptolopes and InterTrust's Digiboxes fall into this category. Clearly, the 

35 secure container approach provides a solution to protecting the document during delivery over insecure channels, but 
does not provide any mechanism to prevent legitimate users from obtaining the clear document and then using and 
redistributing it in violation of content owners' intellectual property. 

[0009] Cryptographic mechanisms are typically used to encrypt (or "encipher') documents that are then distributed 
and stored publicly, and ultimately privately deciphered by authorized users. This provides a basic form of protection 
40 during document delivery from a document distributor to an intended user over a public network, as well as during 
document storage on an insecure medium. 

[0010] In the "trusted system" approach, the entire system is responsible for preventing unauthorized use and dis- 
tribution of the document. Building a trusted system usually entails introducing new hardware such as a secure proc- 
essor, secure storage and secure rendering devices. This also requires that all software applications that run on trusted 
45 systems be certified to be trusted. While building tamper-proof trusted systems is still a real challenge to existing 
technologies, current market trends suggest that open and untrusted systems such as PC's and workstations will be 
the dominant systems used to access copyrighted documents. In this sense, existing computing environments such 
as PC s and workstations equipped with popular operating systems (e.g., Windows and UNIX) and render applications 
(e.g., Microsoft Word) are not trusted systems and cannot be made trusted without significantly altering their architec- 

[0011] Accordingly, although certain trusted components can be deployed, one must continue to rely upon various 
unknown and untrusted elements and systems. On such systems, even if they are expected to be secure, unanticipated 
bugs and weaknesses are frequently found and exploited. 

[0012] There are a number of issues in rights management: authentication, authorization, accounting, payment and 
55 financial clearing, rights specification, rights verification, rights enforcement, and document protection. Document pro- 
tection is a particularly important issue. After a user has honored the rights of the content owner and has been permitted 
to perform a particular operation with a document (e.g., print it, view it on-screen, play the music, or execute the 
software), the document is presumably in-tbe-clear, or unencrypted. Simply stated, the document protection problem 



2 



EP 0 999 488 A2 



is to prevent the content owner's rights from being compromised when the document is in its most vulnerable state: 
stored, in the clear, on a machine within the user's control. Even when documents are securely delivered (typically in 
encrypted form) from a distributor to the user, it must be rendered to a presentation data form before the user can view 
or otherwise manipulate the document. Accordingly, to achieve the highest level of protection, it is important to protect 
the document contents as much as possible while revealing them to the user at a late stage and in a form that is difficult 
to recover into a useful form. 

[001 3] In the known approaches to electronic document distribution that employ encryption, an encrypted document 
is rendered in several separate steps. First, the encrypted document is received by the user. Second, the user employs 
his private key (in a public key cryptosystem) to decrypt the data and derive the document's clear content. Finally, the 
clear content is then passed on to a rendering application, which translates the computer-readable document into the 
finished document, either for viewing on the user's computer screen or for printing a hardcopy. The clear content is 
required for rendering because, in most cases, the rendering application is a third-party product (such as Microsoft 
Word or Adobe Acrobat Reader) that requires the input document to be in a specific format. It should be appreciated, 
then, that between the second and third steps, the previously protected document is vulnerable. It has been decrypted, 
but is still stored in clear electronic form on the user's computer. If the user is careless or is otherwise motivated to 
minimize fees, the document may be easily redistributed without acquiring the necessary permissions from the content 
owner. 

[0014] Accordingly, it would be beneficial to provide an electronic document distribution scheme that minimizes the 
disadvantages of known systems. Such a scheme would prevent users from obtaining a useful form of an electronically- 
distributed document during the decryption and rendering processes. 

[0015] The present self-protecting document ("SPD") is not subject to the above-stated disadvantages of the prior 
art. By combining an encrypted document with a set of permissions and an executable code segment that includes 
most of the software necessary to extract and use the encrypted document, the self-protecting document accomplishes 
protection of document contents without the need for additional hardware and software. 

[0016] The SPD system is broken down between a content creator (analogous to the author and the publisher of the 
traditional model) and a content distributor. The author/publisher creates the original document, and decides what 
rights are to be permitted. The distributor then customizes the document for use by various users, ensuring via the 
customization that the users do not exceed the permissions they purchased. 

[001 7] At the user's system, the self-protecting document is decrypted at the last possible moment. In an embodiment 
of the invention, various rendering facilities are also provided within the SPD, so that the use of the SPD need not rely 
upon external application that might not be trustworthy (and that might invite unauthorized use). In an alternative em- 
bodiment, interfaces and protocols are specified for a third-party rendering application to interact with the SPD to 
provide trusted rendering. 

[0018] In one embodiment of the invention, the encrypted document is decrypted by the user's system while simul- 
taneously "polarizing" it with a key that is dependent, at least in part, on the state of the user's system. The polarization 
may be cryptographically less secure than the encryption used for distribution, but serves to deter casual copying. In 
this embodiment, depolarization is performed during or after the rendering process, so as to cause any intermediate 
form of the document to be essentially unusable. 

FIGURE 1 is a top-level block diagram representing a model for the creation and commercial distribution of elec- 
tronic documents in either secure or insecure environments; 

FIGURE 2 is a flow diagram illustrating the decryption of protected electronic documents according to the art; 
FIGURE 3 is a flow diagram illustrating the decryption of protected electronic documents according to a simple 
embodiment of the invention; 

FIGURE 4 is a flow diagram illustrating the decryption of protected electronic documents according to a preferred 
embodiment of the invention; 

FIGURE 5 is a functional block diagram illustrating the data structures present in a self-protecting document ac- 
cording to an embodiment of the invention; 

FIGURE 6 is a flow diagram illustrating the creation and customization of a self-protecting document according to 
an embodiment of the invention; 

FIGURE 7 is a flow diagram, from a user's perspective, illustrating the actions performed in handling and using a 
self-protecting document according to the invention; 

FIGURE 8 is a graph illustrating several possible paths between an unrendered and encrypted document, and 
rendered and decrypted presentation data; 

FIGURE 9 is a flow diagram illustrating a polarization process according to the invention in which document format 
information remains in the clear for rendering. 



[0019] Figure 1 represents a top-level functional model for a system for the electronic distribution of documents, 
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which as defined above, may include correspondence, books, magazines, journals, newspapers, other papers, soft- 
ware, audio and video clips, and other multimedia presentations. 

[0020] An author (or publisher) 110 creates a document's original content 112 and passes it to a distributor 114 for 
distribution. Although it is contemplated that the author may also distribute documents directly, without involving another 
5 party as a distributor, the division of labor set forth in Figure 1 is more efficient, as it allows the author/publisher 110 to 
concentrate on content creation, and not the mechanical and mundane functions taken over by the distributor 114. 
Moreover, such a breakdown would allow the distributor 1 1 4 to realize economies of scale by associating with a number 
of authors and publishers (including the illustrated author/publisher 110). 

[0021] The distributor 114 then passes modified content 116 to a user 118. In a typical electronic distribution model, 
10 the modified content 116 represents an encrypted version of the original content 112; the distributor 114 encrypts the 
original content 112 with the user 118's public key, and modified content 116 is customized solely for the single user 
118. The user 118 is then able to use his private key to decrypt the modified content 116 and view the original content 
112. 

[0022] A payment 1 20 for the content 1 1 2 is passed from the user 1 1 8 to the distributor 1 1 4 by way of a clearinghouse 

'5 122. The clearinghouse 122 collects requests from the user 118 and from other users who wish to view a particular 
document. The clearinghouse 122 also collects payment information, such as debit transactions, credit card transac- 
tions, or other known electronic payment schemes, and forwards the collected users' payments as a payment batch 
124 to the distributor 114. Of course, it is expected that the clearinghouse 122 will retain a share of the user's payment 
120. In turn, the distributor 114 retains a portion of the payment batch 124 and forwards a payment 126 (including 

20 royalties) to the author and publisher 11 0. In one embodiment of this scheme, the distributor 114 awaits a bundle of 
user requests for a single document before sending anything out. When this is done, a single document with modified 
content 116 can be generated for decryption by all of the requesting users. This technique is well4<nown in the art. 
[0023] In the meantime, each time the user 118 requests (or uses) a document, an accounting message 128 is sent 
to an audit server 130. The audit server 130 ensures that each request by the user 118 matches with a document sent 

25 by the distributor 114; accounting information 131 is received by the audit server 130 directly from the distributor 114. 
Any inconsistencies are transmitted via a report 132 to the clearinghouse 122, which can then adjust the payment 
batches 124 made to the distributor 114. This accounting scheme is present to reduce the possibility of fraud in this 
electronic document distribution model, as well as to handle any time-dependent usage permissions that may result 
in charges that vary, depending on the duration or other extent of use. 

30 [0024] The foregoing model for electronic commerce in documents, shown in Figure 1 , is in common use today. As 
will be shown in detail below, it is equally applicable to the system and method set forth herein for the distribution of 
self-protecting documents. 

[0025] Turning now to Figure 2, the steps performed by the user 118 (Figure 1 ) in a prior art system for electronic 
document distribution are shown. As discussed above, cryptographic mechanisms are typically used to encipher doc- 
35 uments. Those encrypted documents are then distributed and stored publicly and deciphered privately by authorized 
users. This provides a basic form of protection during document delivery from a document distributor to an intended 
user over a public network, as well as during document storage on an insecure medium. 

[0026] At the outset, an encrypted document 210 is received by the user 118 and passed to a decryption step 212. 
As is well known in the art, the decryption step 212 receives the user 118's private key, which is stored locally at the 
40 user's computer or entered by the user when needed. The document 210 is decrypted, resulting in clear content 216 
similar or identical to the original content 112 (Figure 1). 

[0027] The clear content 216 is passed to a rendering application 218, which constructs presentation data 220, or 
a usable version of the document's original content 112. In typical systems of this kind, the presentation data 220 is 
data immediately suitable for display on a video screen, for printing as a hardcopy, or for other use depending on the 
45 document type. 

[0028] As discussed above, the document is vulnerable in systems like this. The clear content 216 can be copied, 
stored, or passed along to other users without the knowledge or consent of the distributor 114 or the author/publisher 
110. Even a legitimate user may be tempted to minimize the licensing fees by capturing the document in the clear in 
order to redistribute and use it at will, without honoring the intellectual property of the content owners. As discussed 
so above, the present invention is directed to a scheme for preventing such a user from obtaining a useful form of the 
document during the rendering process on the user's system. 

[0029] Accordingly, the system and method of the present invention sets forth an alternative scheme for handling 
encrypted documents at the user 118's system. A simple embodiment of this scheme is illustrated in Figure 3. 
[0030] Figure 3 looks similar to Figure 2, in that an encrypted document 310 is passed to a decryption step 312 
55 (which uses a private key 314) and a rendering application 316, resulting in presentation data 318. However, an addi- 
tional layer of protection is provided by a protecting shell 320. The protecting shell 320 allows the document 310 to be 
decrypted and rendered without ever leaving clear content (as in the clear content 216 of Figure 2) available to be 
intercepted. This is accomplished by including decryption and rendering elements within the document 310, as wiH be 
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described below with reference to Figure 5. The included decryption and rendering elements are adapted to limit the 
user's interaction with the SPD, prohibiting certain operations (such as saving the document or performing cut-and- 
paste operations) according to the user's permissions. 

[0031] Figure 4 is a more sophisticated version. The scheme of Figure 4 includes an intermediate "polarization" step 
s adapted to secure the document after it has been decrypted but before it is rendered. First, the encrypted document 
contents 41 0 are passed to a polarizer 41 2. The polarizer 41 2 receives the user's private key 414 and, via a decryption 
step 416, decrypts the document contents 410. Concurrently, the polarizer 412 receives a polarization key 418 from 
the user's system. 

[0032] This polarization key 41 8 is used by the polarizer 41 2 to transform the document to a version having polarized 
10 contents 420. All of these operations can take place in the open, without any kind of protective mechanism, provided 

the polarizer 412 does not store a clear version of the document between decrypting it and polarizing it. 

[0033] In one embodiment of the invention, the polarization key 418 represents a combination of data elements taken 

from the user's system's internal state, such as the date and time of day, elapsed time since the last keystroke, the 

processor's speed and serial number, and any other information that can be repeatably derived from the user's system. 
15 It is useful to include some time-derived information in the polarization key 418 so that interception and seizure of 

polarized contents 420 would not be useful. Further rendering of the polarized document would not be possible, as the 

system time would have changed too much. 

[0034] Then, once again within a protecting shell 422, the polarized contents 420 are passed to a rendering appli- 
cation 424. As discussed above, typical rendering applications are third-party applications such as Microsoft Word or 
20 Adobe Acrobat Reader. However, it is likely that such external rendering applications will not be able to process the 
polarized contents 420, as the contents, any formatting codes, and other cues used by the Tenderer will have been 
scrambled in the polarization process. 

[0035] Hence, the rendering application 424 must be commutative (or at least fault-tolerant), or it must receive po- 
larized contents 420 that are largely complete and processable by the application. The latter possibility will be discussed 

25 below, in connection with Figure 9. 

[0036] The output of the rendering application is polarized presentation data 426, which has been formatted by the 
rendering application 424 but is still polarized, and hence not readable by the user. The polarized presentation data 
426 is passed to a depolarizer 428, which receives the polarization key 418 and restores the original form of the 
document as presentation data 430. In one embodiment of the invention, the depolarization function is combined with 

so the rendering or display function. In this case, the polarized presentation data 426 is received directly by a display 
device, which can be separate from the user's system and receive data over a communications channel. 
[0037] Creation of the polarization key 418, the rendering application 418, and the depolarization step 428 are all 
elements of the protecting shell 422; these are tamper-resistant program elements. It is contemplated that all compu- 
tational steps that occur within the protecting shell 422 will use local data only, and will not store temporary data to any 

35 globally accessible storage medium or memory area; only the explicit results will be exported from the protecting shell 
422. This approach will prevent users from easily modifying operating system entry points or scavenging system re- 
sources so as to intercept and utilize intermediate data. 

[0038] It should be noted that the presentation data 430 of Figure 4, in alternative embodiments of the invention, can 
be either device independent or device dependent. In the device-independent case, additional processing by a device 
40 driver (such as a display driver or a printer driver) typically is necessary to complete the rendering process. In the 
presently preferred device-dependent case, the device-specific modifications to the presentation data have already 
been made (either in the rendering application 424 or the depolarizing step 428), and the presentation data 430 can 
be sent directly to the desired output device. 

[0039] The decryption schemes described with reference to Figures 3 and 4 above are enabled by a unique document 
45 structure, which is shown in detail in Figure 5. As discussed above, certain operations performed by the system and 
method of the invention require trusted components. One way to ensure that certain unmodified code is being used to 
perform the trusted aspects of the invention is to provide the code along with the documents. The various components 
of a self-protecting document according to the invention are illustrated in Figure 5. 

[0040] The problem of document protection is approached by the invention without any assumptions on the presence 
50 of trusted hardware units or software modules in the user's system. This is accomplished by enhancing a document 
to be an active meta-document object. Content owners (i.e., authors or publishers) attach rights to a document that 
specify the types of uses, the necessary authorizations and the associated fees, and a software module that enforces 
the permissions granted to the user. This combination of the document, the associated rights, and the attached software 
modules that enforce the rights is the self-protecting document ("SPD") of the invention. A self-protecting document 
55 prevents the unauthorized and uncontrolled use and distribution of the document, thereby protecting the rights of the 
content owners. 

[0041] The self-protecting document 510 includes three major functional segments: an executable code segment 
512 contains certain portions of executable code necessary to enable the user to use the encrypted document; a rights 
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and permissions segment 514 contains data structures representative of the various levels of access that are to be 
permitted to various users; and a content segment 516 includes the encrypted content 116 (Figure 1 ) sought to be 
viewed by the user. 

[0042] In a preferred embodiment of the invention, the content segment 516 of the SPD 510 includes three subsec- 
5 tions: document meta-information 518 (including but not limited to the document's title, format, and revision date), rights 
label information 520 (such as a copyright notice attached to the text, as well as rights and permissions information), 
and the protected content 520 (the encrypted document itself). 

[0043] In one embodiment of the invention, the rights and permissions segment 514 includes information on each 
authorized user's specific rights. A list of terms and conditions may be attached to each usage right. For example, user 
10 John Doe may be given the right to view a particular document and to print it twice, at a cost of $10. In this case, the 
rights and permissions segment 514 identifies John Doe, associates two rights with him (a viewing right and a printing 
right), and specifies terms and conditions including the price ($10) and a limitation on printing (twice). The rights and 
permissions segment 514 may also include information on other users. 

[0044] In an alternative embodiment, the rights and permissions segment 514 includes only a link to external infor- 
ms mation specifying rights information. In such a case, the actual rights and permissions are stored elsewhere, for example 
on a networked permission server, which must be queried each time the document is to be used. This approach provides 
the advantage that rights and permissions may be updated dynamically by the content owners. For example, the price 
for a view may be increased, or a user's rights may be terminated if unauthorized use has been detected. 
[0045] In either scenario, the rights and permissions segment 514 is cryptographically signed (by methods known 
20 in the art) to prevent tampering with the specified rights and permissions; it may also be encrypted to prevent the user 
from directly viewing the rights and permissions of himself and others. 

[0046] The executable code segment 512, also called the "SPD Control," also contains several subsections, each 
of which comprises a software module at least partially within the executable code segment. In one embodiment of 
the invention, the Java programming language is used for the SPD Control; however, it is contemplated that any plat- 
25 form-independent or platform-specific language, either interpreted or compiled, can be used in an implementation of 
this invention. 

[0047] A rights enforcer 524 is present to verify the users identity, to compare a requested action by the user to those 
actions enumerated in the rights and permissions segment 514, and to permit or deny the requested action depending 
on the specified rights. The operation of the rights enforcer 524 will be discussed in further detail below, in connection 
30 with Figure 7. 

[0048] A secured polarization engine 526 is also present within the executable code segment 51 2; it serves to read 
and polarize the data according to the system state (or other polarization key) as discussed above. In a preferred 
embodiment of the invention, the polarization engine 526 acts upon the document before it is stored or decrypted, so 
the document is never stored in the clear on the user's system. The polarization engine 526 is secured, that is, it is 

35 cryptographically signed and encrypted, to prevent tampering, reverse-engineering, and disassembling. 

[0049] A counterpart depolarization engine 528 is also included to enable the generation of clear presentation data 
from the polarized content (see Figure 4). The depolarization engine includes a set of secure window objects, providing 
a relatively tamper-proof interface to the rendering API (application program interface) of the user's system. The secure 
window objects are resistant to being intercepted, thereby reducing the possibility that the document, in its clear form, 

to can be reconstructed by intercepting and receiving the data intended for the operating system. 

[0050] A counterpart depolarization engine 528 is also included to enable the generation of clear presentation data 
from the polarized content (see Figure 4). The depolarization engine 528 provides a relatively tamper-proof interface 
to the logical or physical output device (e.g., the user's display device). The input to the depolarization engine 528 is 
polarized presentation data. Therefore, if that data is intercepted, it will not reveal any of the clear content without 

•<5 further depolarization which depends on, for example, the user's system state. 

[0051] A secure viewer 530 is optionally included in the executable code segment 512. The secure viewer 530 is 
used to permit only those levels of access that are permitted according to the rights and permissions segment 514. 
For example, if the user purchased only sufficient rights to view a document (and not to save or print it), the viewer will 
not permit the user to save, print, or perform the standard cut-and-paste operations possible in most modern operating 

so systems. 

[0052] Finally, a rendering engine 532 is included or referenced within the executable code segment 512. The ren- 
dering engine 532 need not be secure. Accordingly, the code for the rendering engine 532 can be included within the 
SPD applet, or alternatively retrieved (via a secure link) from some other location. In either case, the rendering engine 
532 is adapted to receive polarized document contents and produced polarized presentation data therefrom (see Figure 

55 4). 

[0053] The foregoing aspects and elements of the self-protecting document 510 will be discussed in further detail 
below, in conjunction with the operation of the system. 

[0054] Figure 6 shows the steps performed when a self-protecting document 5 1 0 is created and d istributed . A generic 
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SPD 610 includes no user-specific rights information and is not encrypted for any particular user. The generic SPD 
610 is created from three items: the original document content 612, in clear (unencrypted) form; a high-level rights 
specification 614; and an optional watermark 616. 

[0055] The content 612 is pre-processed (step 618) to lay out the document as desired by the author or publisher. 

5 For example, a preferred page size, font, and page layout may be selected. The content 612 is essentially "pre-ren- 
dered" in the content pre-processing step so that it will be in a format that is compatible with users' systems and the 
SPD. For example, the content 61 2 may be converted from Microsoft Word (".DOC") or Adobe Acrobat (".PDF") format 
to a different format specially adapted to be read by the rendering engine 532 (Figure 5). In one embodiment of the 
invention, multiple versions of the content 612 are generated by the content pre-processing step and stored in the 

10 generic SPD 610; those different versions may then be separately purchased by the user according to his needs. 

[0056] The high-level rights specification 614 sets forth what combinations of access rights are permissible. Such a 
rights specification is tailored to a particular document, and is capable of describing different groups of rights for different 
classes of downstream users. For example, a publisher may be given the right to distribute up to 100,000 copies of a 
document at a $1 .00 per copy royalty, with additional copies yielding a $2.00 royalty. Similarly, users may be given the 

'5 option to purchase a version of the document that "times out" after one month, one year, or never. Several possible 
limitations are described with reference to a detailed example, which is set forth below. 

[0057] Digital Property Rights Language (DPRL) is a language that can be used to specify rights for digital works. It 
provides a mechanism in which different terms and conditions can be specified and enforced for rights. Rights speci- 
fications are represented as statements in DPRL. For details, see, for example, U.S. Patent No. 5,715,403 to Stefik, 
20 entitled "System for Controlling the Distribution and Use of Digital Works Having Attached Usage Rights Where the 
Usage Rights are Defined by a Usage Rights Grammar." Enforcement of rights and verification of conditions associated 
with rights is performed using the SPD technology. 

[0058] Different rights can be specified for different parts of a digital work using a "work" specification. Within a work 
specification, different sets of rights applicable to this work are specified. Rights can be grouped into named-groups 

25 called "rights groups". Each right within a rights group is associated with a set of conditions. Conditions can be of 
different types: fee to be paid, time of use, type of access, type of watermark, type of device on which the operation 
can be performed, and so on. DPRL allows different categories of rights: transfer, render rights, derivative work rights, 
file management rights and configuration rights. Transport rights govern the movement of a work from one repository 
to another. Render rights govern the printing and display of a work, or more generally, the transmission of a work 

30 through a transducer to an external medium (this includes the "export" right, which can be used to make copies in the 
clear). Derivative work rights govern the reuse of a work in creating new works. File management rights govern making 
and restoring backup copies. Finally, configuration rights refer to the installation of software in repositories.{XE "rights: 
categories of} 

[0059] An exemplary work specification in DPRL is set forth below: 
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(Work: 

(Rights-Language-Version: 1.02) 

(Work-ID: "ISDN-1-55860-166-X; AAP-2348957tut") 

(Description: "Title: 'Zuke-Zack, the Moby Dog Story' 

Author: 'John Beagle' 

Copyright 1994 Jones Publishing") 
(Owner: (Certificate: 

(Authority: "Library of Congress") 
(ID: "Murphy Publishers"))) 
(Parts: "Photo-Celebshots-Dogs-23487gfj" "Dog-Breeds-Chart-AKC") 
(Comment: "Rights edited by Pete Jones. June 1996 ") 
(Contents: (From: 1) (To: 16636)) 
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(Rights-Group: "Regular" 

(Comment: "This set of rights is used for standard retail editions ") 
(Bundle: 

(Time: (Until: 1998/01/01 0:01)) 

(Fee: (To: "Jones-PBLSH-18546789")(House: "Visa"))) 

(Play: 

(Fee: (Metered: (Rate: 1.00 USD) (Per: 1:0:0) (By: 0:0:1)) )) 

(Print: 

(Fee: (Per-Use: 10 00 USD)) 
(Printer: 

(Certificate: 

(Authority: "DPT" 

(Type: "TrustedPrinter-6"))) 

(Watermark: 

(Watermark-Str: "Title: Zeke Zack - the Moby Dog' 

Copyright 1994 

by Zeke Jones. 

All Rights Reserved.") 
(Watermark-Tokens: user-id institution-location 

render-name render-time) ))) 

(Transfer: ) 

(Copy:(Fee: (Per-Use: 10.00 USD) )) 
(Copy: (Access: 

(User: (Certificate: 

(Authority: "Murphy Publishers") 

(Type: "Distributor))))) 

(Delete:) 
(Backup:) 

(Restore: (Fee: (Per-Use: 5.00 USD))) )) 



[0060] This work specification has a rights group called "Regular," which specifies rights for standard retail editions 
of a book titled "Zuke-Zack, the Mot>y Dog Story." The work specification expresses conditions for several rights: play, 
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print, transfer, copy, delete, backup, and restore. The work in the example includes two other parts, a photograph and 
a chart of breeds incorporated from other sources. A "bundle" specification bundles a set of common conditions that 
apply to all rights in the group. This specification states that all rights in the group are valid until January 1 , 1998 and 
that the fee should be paid to account "Jones-PBLSH-1 8546789". The clearinghouse for this transaction should be 

s Visa. The following contract applies: the work can be played by paying $1 .00 every hour, where fee is accumulated by 
the second; the work can be printed on TrustedPrinter-6 which is certified by "DPT" for a fee of $10.00 per print; the 
printed copy should have a watermark string (as depicted) and a list of tokens signifying "fingerprint" information known 
at the time it is printed; this work can be copied either by paying $10.00 or by acquiring a distributor certificate from 
Murphy publishing; and unrestricted transfer, deletion or backing up of this work is permitted (restoration costs $5.00). 

10 [0061] The high-level rights specification 614 is also subject to a pre-processing step (step 620), in which the high- 
level (i.e., human-readable) specification is compiled into a more-efficient data structure representation for use by the 
invention. 

[0062] The generic SPD 610 is then created (step 622) by combining the pre-processed content 612, the pre-proc- 
essed rights specification 614, and the watermark 616. A watermark may be added by any means known in the art; it 
'5 may be either visible or concealed within the SPD. The generic SPD 610 may also optionally be encrypted by the 
author/publisher 110 for transmission to the distributor 114 (Figure 1). 

[0063] The generic SPD 610 is then received by the distributor 114, and is stored for later customization. When a 
user request 624 is received by the distributor 114 (either directly or through the clearinghouse 122 or other interme- 
diary), the distributor 1 1 4 creates a set of user permissions (step 626) that is consistent with both the user request 624 
20 and the rights specification 614. If there is no such consistent set of permissions, then no further action is performed 
on that user's behalf (other than an optional notification message to the user). 

[0064] The user permissions and the user's public key 628 are then used to generate (step 630) a customized SPD 
632 adapted to be used by the user. The user permissions from step 626 are stored in the rights and permissions 
segment 514 of the SPD 632, and the user's public key 628 is used to encrypt the content in the content segment 516 

25 of the SPD 632. A public-key encryption mechanism can be used to transform the SPD from the generic form to the 
customized SPD 632. Such a mechanism is useful if the SPD has to be confidentially transferred between different 
parties, e.g., author to publisher to retailer to consumer, with rights protection at each stage. It should further be noted 
that multiple user requests can be composed and accommodated within a single SPD 632; there are techniques known 
in the art that are capable of using multiple public keys to encrypt a document such that any of the users' private keys 

30 can be used to decrypt it. 

[0065] The resulting custom SPD 632 is then transmitted to the user 118 by any available means, such as via a 
computer network or stored on a physical medium (such as a magnetic or optical disk). 

[0066] The operations performed when a user receives an SPD are depicted in the flow diagram of Figure 7. The 
SPD is first received and stored at the user's system (step 710); in many cases, it is not necessary to use the SPD 

35 right away. When usage is desired, the user is first authenticated (step 71 2), typically with a user name and a password 
or key. The system then determines what action is desired by the user (step 714). When an action is chosen, the rights- 
enforcement step of the invention (step 716) verifies the conditions associated with the desired action (such as the 
fee, time, level of access, watermark, or other conditions); this can be performed locally via the SPD applet 512 (Figure 
5) or by accessing a rights enforcement server. 

40 [0067] If the rights enforcement step (step 716) fails, an update procedure (step 718) is undertaken. The user may 
choose to update his permissions, for example by authorizing additional fees. After the satisfactory verification of con- 
ditions, a pre-audit procedure (step 718) is performed, in which the SPD system logs verification status to a tracking 
service (e.g., the audit server 130 of Figure 1). The content is then securely rendered to the screen (step 722) as 
discussed above. When the user is finished, a post-audit procedure (step 724) is performed in which the amount of 

■« usage is updated with the tracking service. The SPD system then awaits further action. 

[0068] The protection yielded by the SPD is derived from the user's inability to capture a useful form of the document 
at any intermediate stage during the rendering process. This is accomplished by decrypting the document contents to 
a clear form at the latest possible stage, ideally in the last step. 

[0069] The SPD decryption model is illustrated in Figure 8. E denotes the encryption function performed by the 
50 publisher; D denotes the decryption performed at the user's system, and R denotes the rendering transformation. Many 
prior systems use a first sequence of transformations 810, D(E(x)) followed by R(D(E(x))). As stated previously, the 
early decryption leaves the document in a vulnerable state. Ideally, the transformations are performed in the reverse 
order 812, R'(E(x)) followed by D(R'(E(x))). This postpones decryption to the latest possible time. 
[0070] The existence of R', a rendering operation that can be performed before decryption, is determined by the 
55 following equality: 

D(R'(E(x))) = R(D(E(x))) 
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In case that the encryption and decryption functions are commutative, that is, E(D(x)) = D(E(x)) for any x, the existence 
of R' is ensured: 



5 R'(y) = E(R(D(y)))fory = E(x) 

In practice, encryption and decryption functions in popular public-key cryptographic systems such as the RSA system 
and EIGamal discrete logarithm system satisfy the commutation requirement. This means that the transformation R' 
exists if these cryptographic systems are used for encryption and decryption. 

10 [0071] The path x' = D(R'(E(x))) portrays an ideal SPD solution to the document protection against unauthorized 
document usage and distribution. A scenario of distributing and using a document can be described as follows. When 
a user purchases the document, the document is encrypted using a user's public information and is transmitted over 
an insecure network channel such as the Internet. The encrypted document has the rights information attached to it 
and a protecting applet 512 that enforces the rights and permissions granted to the user by the content owner. Upon 

15 a user's request on using the document, the applet verifies the rights and permissions and generates from the encrypted 
document the presentation format of the original document. As any intermediate form of the document before the final 
presentation data is encrypted with the user's private information, the SPD model of document protection ensures that 
any intermediate form of the document is not useful to other systems wherever it is intercepted. 
[0072] Clearly, this ideal model relies on whether or not the transformation R' that corresponds to the rendering 

20 transformation Rcan be computed efficiently, and in particular on whether or not an invocation of the decryption function 
D is necessary during an implementation of R'. A trivial case in which R' can be implemented efficiently is where R is 
commutative with the encryption function E. When this happens, 

25 R '(V) = E(R(D(y))) = R(E(D(y))) = R(y) 

for y = E(x). In this case, R' = R. 

[0073] Consideration of Figure 8 reveals that many intermediate solutions (e.g., intermediate solutions 814, 816, 
and 818) to the document protection problem may exist on the user's system between the two extremes x' = R(D(E 
30 (x))), which has no protection on x = D(E(x)), and x' = D(R'(E(x))), which has ideal protection (under the assumptions 
set forth above). As depicted in Figure 8, one may consider different paths from the encrypted document E(x) to the 
presentation data x' that correspond to different combinations of partial rendering transformations and partial decryption 
transformations. Again, it should be recognized that delaying the decryption D in any path increases the protection 
level to the document. 

35 [0074] As discussed above, one alternative method of delaying decryption to the last possible moment employs a 
polarization technique that encrypts only the document contents, not the format or the entire document as a whole. 
This possibility is shown in Figure 9. Beginning with the clear document content 910 (which, it should be noted, does 
not exist in any single identifiable location during the user's processing, but is rather a transient state occurring within 
step 412 of Figure 4), the document is split (step 912) into a data portion 914 and a format portion 916. The data portion 
40 9 1 4 is polarized (step 918) using the polarization key 920 and merged (step 922) with the clear format portion 916. 
This results in polarized content 924 that can be rendered to polarized presentation data without first decrypting the 
content. It should be observed that this form of polarization is likely less secure than wholesale encryption with the 
polarization key, since a lot of information can potentially be derived from the layout of a document, word lengths, line 
lengths, etc.; however, this scheme will present a useful deterrent to casual copyright infringement. 
45 [0075] While certain exemplary embodiments of the invention have been described in detail above, it should be 
recognized that other forms, alternatives, modifications, versions and variations of the invention are equally operative 
and would be apparent to those skilled in the art. For example, the portions of the invention described above that are 
described as software components could be implemented as hardware. Moreover, while certain functional blocks are 
described herein as separate and independent from each other, these functional blocks can be consolidated and per- 
se formed on a single general-purpose computer, or further broken down into subfunctions as recognized in the art. 

[0076] According to a preferred embodiment of the method for creating a self-protecting document, the modifying 
step further comprises the step of pre-processing the unencrypted document. In addition, it advantageously further 
comprises the step of customizing the generic self-protecting document. 

[0077] The customizing step preferably comprises the substeps of receiving a user's document request, receiving 
55 the user's public key, creating a rights and permissions segment consistent with the user's document request and the 
rights specification, encrypting the original content segment to produce an encrypted content segment and combining 
the code segment, the rights and permissions segment and the encrypted content segment to produce a customized 
setf-proteeting document. 
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[0078] According to a preferred embodiment of the method for using a self-protecting document having an encrypted 
content segment on a user's system, the step of modifying the encrypted content segment comprises the substep of 
transforming the encrypted content segment via an encryption algorithm and employing the polarization key. Further 
the re-encryption step comprises the substeps of identifying data information and format information within the en- 
crypted content segment, separating the data information and the format information from the encrypted content seg- 
ment, encrypting the data information with the polarization key, and combining the encrypted data information with the 
format information to produce the polarized content. 

[0079] According to an advantageous embodiment, the polarization key comprises a combination of state information 
derived from the user's system. 



1. A self-protecting document embodied as data stored on a tangible storage medium, the self-protecting document 
comprising: 

an encrypted content segment containing data representative of document contents; 
a permissions segment; and 
a code segment. 

2. The self-protecting document of claim 1 , wherein the code segment comprises: 

a rights-enforcement subsection; and 
a rendering subsection. 

3. The self-protecting document of claim 1 , wherein the code segment further comprises a polarization subsection. 

4. The self-protecting document of claim 3, wherein the polarization subsection comprises a polarization engine and 
a depolarization engine. 

5. The self-protecting document of claim 1 , wherein the polarization subsection contains executable computer code 
adapted to modify the encrypted content segment. 

6. The self-protecting document of claim 1 , wherein the polarization subsection contains executable computer code 
adapted to modify the encrypted content segment into a polarized content segment. 

7. A method for creating a self-protecting document, comprising the steps of: 

receiving an unencrypted document; 

modifying the unencrypted document to produce an original content segment; 
creating a rights specification; 
creating a code segment; and 

combining the original content segment, the rights specification, and the code segment to produce a generic 
self-protecting document. 

8. The method of claim 7, wherein the modifying step comprises the step of encrypting the unencrypted document. 

9. A method for using a self-protecting document having an encrypted content segment on a user's system, com- 
prising the steps of: 

obtaining a polarization key; 

modifying the encrypted content segment with the polarization key to produce to produce polarized content; 
rendering the polarized content to produce rendered polarized content for output on an output device; 
depolarizing the rendered polarized content with the polarization key to produce rendered clear content; and 
sending the rendered clear content to the output device. 

10. The method of claim 9, wherein the step of modifying the encrypted content segment comprises the substep of 
transforming the encrypted content segment via an encryption algorithm employing the polarization key. 
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